Overview

Dataset statistics

Number of variables26
Number of observations31465
Missing cells106534
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.0 MiB
Average record size in memory201.0 B

Variable types

NUM12
CAT8
DATE4
BOOL1
UNSUPPORTED1

Reproduction

Analysis started2021-06-22 20:40:38.187247
Analysis finished2021-06-22 20:41:11.329357
Duration33.14 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Status has constant value "5" Constant
PurchaseOrderNumber has a high cardinality: 3806 distinct values High cardinality
AccountNumber has a high cardinality: 19119 distinct values High cardinality
CreditCardApprovalCode has a high cardinality: 30334 distinct values High cardinality
ShipToAddressID is highly correlated with BillToAddressIDHigh correlation
BillToAddressID is highly correlated with ShipToAddressIDHigh correlation
ShipMethodID is highly correlated with OnlineOrderFlagHigh correlation
OnlineOrderFlag is highly correlated with ShipMethodIDHigh correlation
TaxAmt is highly correlated with SubTotal and 2 other fieldsHigh correlation
SubTotal is highly correlated with TaxAmt and 2 other fieldsHigh correlation
Freight is highly correlated with SubTotal and 2 other fieldsHigh correlation
TotalDue is highly correlated with SubTotal and 2 other fieldsHigh correlation
PurchaseOrderNumber has 27659 (87.9%) missing values Missing
SalesPersonID has 27659 (87.9%) missing values Missing
CreditCardID has 1131 (3.6%) missing values Missing
CreditCardApprovalCode has 1131 (3.6%) missing values Missing
CurrencyRateID has 17489 (55.6%) missing values Missing
Comment has 31465 (100.0%) missing values Missing
PurchaseOrderNumber is uniformly distributed Uniform
CreditCardApprovalCode is uniformly distributed Uniform
SalesOrderID has unique values Unique
SalesOrderNumber has unique values Unique
rowguid has unique values Unique
Comment is an unsupported type, check if it needs cleaning or further analysis Unsupported

Variables

SalesOrderID
Real number (ℝ≥0)

UNIQUE

Distinct count31465
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59391.0
Minimum43659
Maximum75123
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum43659
5-th percentile45232.2
Q151525
median59391
Q367257
95-th percentile73549.8
Maximum75123
Range31464
Interquartile range (IQR)15732

Descriptive statistics

Standard deviation9083.307446
Coefficient of variation (CV)0.1529408066
Kurtosis-1.2
Mean59391
Median Absolute Deviation (MAD)7866
Skewness0
Sum1868737815
Variance82506474.17
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
675831< 0.1%
 
668581< 0.1%
 
750541< 0.1%
 
545761< 0.1%
 
566251< 0.1%
 
504821< 0.1%
 
525311< 0.1%
 
627721< 0.1%
 
648211< 0.1%
 
586781< 0.1%
 
Other values (31455)31455> 99.9%
 
ValueCountFrequency (%) 
436591< 0.1%
 
436601< 0.1%
 
436611< 0.1%
 
436621< 0.1%
 
436631< 0.1%
 
ValueCountFrequency (%) 
751231< 0.1%
 
751221< 0.1%
 
751211< 0.1%
 
751201< 0.1%
 
751191< 0.1%
 

RevisionNumber
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
8
31435
9
 
30
ValueCountFrequency (%) 
83143599.9%
 
9300.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count1124
Unique (%)3.6%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
Minimum2011-05-31 00:00:00
Maximum2014-06-30 00:00:00
Histogram
Distinct count1124
Unique (%)3.6%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
Minimum2011-06-12 00:00:00
Maximum2014-07-12 00:00:00
Histogram
Distinct count1124
Unique (%)3.6%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
Minimum2011-06-07 00:00:00
Maximum2014-07-07 00:00:00
Histogram

Status
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
5
31465
ValueCountFrequency (%) 
531465100.0%
 

Length

Max length1
Median length1
Mean length1
Min length1

OnlineOrderFlag
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
True
27659
False
 
3806
ValueCountFrequency (%) 
True2765987.9%
 
False380612.1%
 

SalesOrderNumber
Categorical

UNIQUE

Distinct count31465
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
SO71273
 
1
SO47468
 
1
SO50116
 
1
SO51643
 
1
SO55293
 
1
Other values (31460)
31460
ValueCountFrequency (%) 
SO712731< 0.1%
 
SO474681< 0.1%
 
SO501161< 0.1%
 
SO516431< 0.1%
 
SO552931< 0.1%
 
SO595941< 0.1%
 
SO469561< 0.1%
 
SO482621< 0.1%
 
SO712721< 0.1%
 
SO679381< 0.1%
 
Other values (31455)31455> 99.9%
 

Length

Max length7
Median length7
Mean length7
Min length7

PurchaseOrderNumber
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct count3806
Unique (%)100.0%
Missing27659
Missing (%)87.9%
Memory size245.8 KiB
PO8091154181
 
1
PO9744171621
 
1
PO20097117736
 
1
PO18357170772
 
1
PO19169155150
 
1
Other values (3801)
3801
ValueCountFrequency (%) 
PO80911541811< 0.1%
 
PO97441716211< 0.1%
 
PO200971177361< 0.1%
 
PO183571707721< 0.1%
 
PO191691551501< 0.1%
 
PO148481549751< 0.1%
 
PO120931737991< 0.1%
 
PO192561795001< 0.1%
 
PO28421357311< 0.1%
 
PO181831166861< 0.1%
 
Other values (3796)379612.1%
 
(Missing)2765987.9%
 

Length

Max length13
Median length3
Mean length4.144700461
Min length3

AccountNumber
Categorical

HIGH CARDINALITY

Distinct count19119
Unique (%)60.8%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
10-4030-011176
 
28
10-4030-011091
 
28
10-4030-011331
 
27
10-4030-011262
 
27
10-4030-011330
 
27
Other values (19114)
31328
ValueCountFrequency (%) 
10-4030-011176280.1%
 
10-4030-011091280.1%
 
10-4030-011331270.1%
 
10-4030-011262270.1%
 
10-4030-011330270.1%
 
10-4030-011277270.1%
 
10-4030-011300270.1%
 
10-4030-011287270.1%
 
10-4030-011276270.1%
 
10-4030-011200270.1%
 
Other values (19109)3119399.1%
 

Length

Max length14
Median length14
Mean length14
Min length14

CustomerID
Real number (ℝ≥0)

Distinct count19119
Unique (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20170.17568727157
Minimum11000
Maximum30118
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum11000
5-th percentile11492
Q114432
median19452
Q325994
95-th percentile29844
Maximum30118
Range19118
Interquartile range (IQR)11562

Descriptive statistics

Standard deviation6261.72896
Coefficient of variation (CV)0.310444939
Kurtosis-1.341699558
Mean20170.17569
Median Absolute Deviation (MAD)5583
Skewness0.1802670394
Sum634654578
Variance39209249.57
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11091280.1%
 
11176280.1%
 
11300270.1%
 
11200270.1%
 
11262270.1%
 
11185270.1%
 
11223270.1%
 
11287270.1%
 
11711270.1%
 
11330270.1%
 
Other values (19109)3119399.1%
 
ValueCountFrequency (%) 
110003< 0.1%
 
110013< 0.1%
 
110023< 0.1%
 
110033< 0.1%
 
110043< 0.1%
 
ValueCountFrequency (%) 
301188< 0.1%
 
3011712< 0.1%
 
301164< 0.1%
 
301158< 0.1%
 
301148< 0.1%
 

SalesPersonID
Real number (ℝ≥0)

MISSING

Distinct count17
Unique (%)0.4%
Missing27659
Missing (%)87.9%
Infinite0
Infinite (%)0.0%
Mean280.6079873883342
Minimum274.0
Maximum290.0
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum274
5-th percentile275
Q1277
median279
Q3284
95-th percentile289
Maximum290
Range16
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.846964646
Coefficient of variation (CV)0.01727308154
Kurtosis-0.8639785694
Mean280.6079874
Median Absolute Deviation (MAD)3
Skewness0.6426601181
Sum1067994
Variance23.49306628
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2774731.5%
 
2754501.4%
 
2794291.4%
 
2764181.3%
 
2893481.1%
 
2822710.9%
 
2812420.8%
 
2782340.7%
 
2831890.6%
 
2901750.6%
 
Other values (7)5771.8%
 
(Missing)2765987.9%
 
ValueCountFrequency (%) 
274480.2%
 
2754501.4%
 
2764181.3%
 
2774731.5%
 
2782340.7%
 
ValueCountFrequency (%) 
2901750.6%
 
2893481.1%
 
2881300.4%
 
287390.1%
 
2861090.3%
 

TerritoryID
Real number (ℝ≥0)

Distinct count10
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.090767519466073
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q39
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.958119227
Coefficient of variation (CV)0.4856726541
Kurtosis-1.083345417
Mean6.090767519
Median Absolute Deviation (MAD)2
Skewness-0.3904916518
Sum191646
Variance8.750469361
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9684321.7%
 
4622419.8%
 
1459414.6%
 
6406712.9%
 
10321910.2%
 
726728.5%
 
826238.3%
 
54861.5%
 
33851.2%
 
23521.1%
 
ValueCountFrequency (%) 
1459414.6%
 
23521.1%
 
33851.2%
 
4622419.8%
 
54861.5%
 
ValueCountFrequency (%) 
10321910.2%
 
9684321.7%
 
826238.3%
 
726728.5%
 
6406712.9%
 

BillToAddressID
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count19119
Unique (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18263.154425552202
Minimum405
Maximum29883
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum405
5-th percentile698.2
Q114080
median19449
Q324678
95-th percentile28863
Maximum29883
Range29478
Interquartile range (IQR)10598

Descriptive statistics

Standard deviation8210.069158
Coefficient of variation (CV)0.449542777
Kurtosis-0.01743750554
Mean18263.15443
Median Absolute Deviation (MAD)5299
Skewness-0.8240474389
Sum574650154
Variance67405235.58
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
13550280.1%
 
25073280.1%
 
21596270.1%
 
13890270.1%
 
14444270.1%
 
21639270.1%
 
27756270.1%
 
27048270.1%
 
29638270.1%
 
22168270.1%
 
Other values (19109)3119399.1%
 
ValueCountFrequency (%) 
4054< 0.1%
 
4064< 0.1%
 
4073< 0.1%
 
4084< 0.1%
 
4094< 0.1%
 
ValueCountFrequency (%) 
298831< 0.1%
 
298821< 0.1%
 
298811< 0.1%
 
298801< 0.1%
 
298791< 0.1%
 

ShipToAddressID
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count19119
Unique (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18249.192563165423
Minimum9
Maximum29883
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum9
5-th percentile687
Q114063
median19438
Q324672
95-th percentile28859.8
Maximum29883
Range29874
Interquartile range (IQR)10609

Descriptive statistics

Standard deviation8218.429263
Coefficient of variation (CV)0.4503448158
Kurtosis-0.02121916664
Mean18249.19256
Median Absolute Deviation (MAD)5305
Skewness-0.8231416425
Sum574210844
Variance67542579.55
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
13550280.1%
 
25073280.1%
 
29638270.1%
 
13890270.1%
 
14444270.1%
 
25057270.1%
 
12222270.1%
 
22168270.1%
 
24521270.1%
 
27756270.1%
 
Other values (19109)3119399.1%
 
ValueCountFrequency (%) 
98< 0.1%
 
111< 0.1%
 
122< 0.1%
 
131< 0.1%
 
151< 0.1%
 
ValueCountFrequency (%) 
298831< 0.1%
 
298821< 0.1%
 
298811< 0.1%
 
298801< 0.1%
 
298791< 0.1%
 

ShipMethodID
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
1
27659
5
 
3806
ValueCountFrequency (%) 
12765987.9%
 
5380612.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

CreditCardID
Real number (ℝ≥0)

MISSING

Distinct count18384
Unique (%)60.6%
Missing1131
Missing (%)3.6%
Infinite0
Infinite (%)0.0%
Mean9684.100448341795
Minimum1.0
Maximum19237.0
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum1
5-th percentile983
Q14894.25
median9719.5
Q314510.75
95-th percentile18323
Maximum19237
Range19236
Interquartile range (IQR)9616.5

Descriptive statistics

Standard deviation5566.299591
Coefficient of variation (CV)0.5747874695
Kurtosis-1.20108221
Mean9684.100448
Median Absolute Deviation (MAD)4808.5
Skewness-0.009064604013
Sum293757503
Variance30983691.14
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11843280.1%
 
11334280.1%
 
6169270.1%
 
7336270.1%
 
10261270.1%
 
9006270.1%
 
14162270.1%
 
2233270.1%
 
18822270.1%
 
9387270.1%
 
Other values (18374)3006295.5%
 
(Missing)11313.6%
 
ValueCountFrequency (%) 
12< 0.1%
 
23< 0.1%
 
31< 0.1%
 
42< 0.1%
 
53< 0.1%
 
ValueCountFrequency (%) 
192371< 0.1%
 
192361< 0.1%
 
192352< 0.1%
 
192343< 0.1%
 
192331< 0.1%
 

CreditCardApprovalCode
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct count30334
Unique (%)100.0%
Missing1131
Missing (%)3.6%
Memory size245.8 KiB
743497Vi85247
 
1
57362Vi16259
 
1
1235631Vi67842
 
1
1225158Vi43093
 
1
731695Vi619
 
1
Other values (30329)
30329
ValueCountFrequency (%) 
743497Vi852471< 0.1%
 
57362Vi162591< 0.1%
 
1235631Vi678421< 0.1%
 
1225158Vi430931< 0.1%
 
731695Vi6191< 0.1%
 
234884Vi246271< 0.1%
 
840648Vi248191< 0.1%
 
48008Vi751271< 0.1%
 
317373Vi763121< 0.1%
 
920339Vi421421< 0.1%
 
Other values (30324)3032496.4%
 
(Missing)11313.6%
 

Length

Max length15
Median length13
Mean length12.65215319
Min length3

CurrencyRateID
Real number (ℝ≥0)

MISSING

Distinct count2514
Unique (%)18.0%
Missing17489
Missing (%)55.6%
Infinite0
Infinite (%)0.0%
Mean9191.499570692617
Minimum2.0
Maximum12431.0
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum2
5-th percentile2226.5
Q18510
median10074
Q311282
95-th percentile12208
Maximum12431
Range12429
Interquartile range (IQR)2772

Descriptive statistics

Standard deviation2945.170095
Coefficient of variation (CV)0.3204232424
Kurtosis1.168296897
Mean9191.499571
Median Absolute Deviation (MAD)1309
Skewness-1.38530835
Sum128460398
Variance8674026.891
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9755440.1%
 
12098420.1%
 
8743390.1%
 
11757390.1%
 
10767390.1%
 
11086390.1%
 
10085370.1%
 
9084370.1%
 
11084320.1%
 
4728310.1%
 
Other values (2504)1359743.2%
 
(Missing)1748955.6%
 
ValueCountFrequency (%) 
21< 0.1%
 
48< 0.1%
 
81< 0.1%
 
153< 0.1%
 
282< 0.1%
 
ValueCountFrequency (%) 
124312< 0.1%
 
124289< 0.1%
 
124268< 0.1%
 
124209< 0.1%
 
124177< 0.1%
 

SubTotal
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count4747
Unique (%)15.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3491.0656730939136
Minimum1.374
Maximum163930.3943
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum1.374
5-th percentile8.99
Q156.97
median782.99
Q32366.96
95-th percentile18060.72568
Maximum163930.3943
Range163929.0203
Interquartile range (IQR)2309.99

Descriptive statistics

Standard deviation11093.45254
Coefficient of variation (CV)3.177669392
Kurtosis40.40220951
Mean3491.065673
Median Absolute Deviation (MAD)755.71
Skewness5.822374024
Sum109846381.4
Variance123064689.2
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3578.2715514.9%
 
2181.56257582.4%
 
782.995931.9%
 
39.985781.8%
 
4.995761.8%
 
2049.09825541.8%
 
2443.355431.7%
 
2071.41965211.7%
 
7.283591.1%
 
1000.43753571.1%
 
Other values (4737)2507579.7%
 
ValueCountFrequency (%) 
1.3741< 0.1%
 
2.291390.4%
 
2.7482< 0.1%
 
3.99950.3%
 
4.995761.8%
 
ValueCountFrequency (%) 
163930.39431< 0.1%
 
160378.39131< 0.1%
 
150837.43871< 0.1%
 
147390.93281< 0.1%
 
146154.56531< 0.1%
 

TaxAmt
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count4745
Unique (%)15.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean323.7557432130939
Minimum0.1099
Maximum17948.5186
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum0.1099
5-th percentile0.7192
Q14.5576
median62.6392
Q3189.5976
95-th percentile1774.88626
Maximum17948.5186
Range17948.4087
Interquartile range (IQR)185.04

Descriptive statistics

Standard deviation1085.05418
Coefficient of variation (CV)3.351459248
Kurtosis42.1117139
Mean323.7557432
Median Absolute Deviation (MAD)60.6008
Skewness5.908727871
Sum10186974.46
Variance1177342.573
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
286.261615514.9%
 
174.5257582.4%
 
62.63925931.9%
 
3.19845781.8%
 
0.39925761.8%
 
163.92795541.8%
 
195.4685431.7%
 
165.71365211.7%
 
0.58243591.1%
 
80.0353571.1%
 
Other values (4735)2507579.7%
 
ValueCountFrequency (%) 
0.10991< 0.1%
 
0.18321390.4%
 
0.21982< 0.1%
 
0.3192950.3%
 
0.39925761.8%
 
ValueCountFrequency (%) 
17948.51861< 0.1%
 
16487.79881< 0.1%
 
14990.65161< 0.1%
 
14587.54131< 0.1%
 
14380.32981< 0.1%
 

Freight
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count4744
Unique (%)15.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.17369304941998
Minimum0.0344
Maximum5608.9121
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum0.0344
5-th percentile0.2248
Q11.4243
median19.5748
Q359.2493
95-th percentile554.65196
Maximum5608.9121
Range5608.8777
Interquartile range (IQR)57.825

Descriptive statistics

Standard deviation339.0794267
Coefficient of variation (CV)3.351458432
Kurtosis42.11171484
Mean101.173693
Median Absolute Deviation (MAD)18.9378
Skewness5.908727926
Sum3183430.252
Variance114974.8576
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
89.456815514.9%
 
54.53917582.4%
 
19.57485931.9%
 
0.99955781.8%
 
0.12485761.8%
 
51.22755541.8%
 
61.08385431.7%
 
51.78555211.7%
 
0.1823591.1%
 
25.01093571.1%
 
Other values (4734)2507579.7%
 
ValueCountFrequency (%) 
0.03441< 0.1%
 
0.05731390.4%
 
0.06872< 0.1%
 
0.0998950.3%
 
0.12485761.8%
 
ValueCountFrequency (%) 
5608.91211< 0.1%
 
5152.43711< 0.1%
 
4684.57861< 0.1%
 
4558.60671< 0.1%
 
4493.85311< 0.1%
 

TotalDue
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count4754
Unique (%)15.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3915.9951093564277
Minimum1.5183
Maximum187487.825
Zeros0
Zeros (%)0.0%
Memory size245.8 KiB

Quantile statistics

Minimum1.5183
5-th percentile9.934
Q162.9519
median865.204
Q32615.4908
95-th percentile20430.54604
Maximum187487.825
Range187486.3067
Interquartile range (IQR)2552.5389

Descriptive statistics

Standard deviation12515.46271
Coefficient of variation (CV)3.195985277
Kurtosis40.56428029
Mean3915.995109
Median Absolute Deviation (MAD)835.0596
Skewness5.830758838
Sum123216786.1
Variance156636806.9
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3953.988415514.9%
 
2410.62667582.4%
 
865.2045931.9%
 
44.17795781.8%
 
5.5145761.8%
 
2264.25365541.8%
 
2699.90185431.7%
 
2288.91875211.7%
 
8.04443591.1%
 
1105.48343571.1%
 
Other values (4744)2507579.7%
 
ValueCountFrequency (%) 
1.51831< 0.1%
 
2.53051390.4%
 
3.03652< 0.1%
 
4.409950.3%
 
5.5145761.8%
 
ValueCountFrequency (%) 
187487.8251< 0.1%
 
182018.62721< 0.1%
 
170512.66891< 0.1%
 
166537.08081< 0.1%
 
165028.74821< 0.1%
 

Comment
Unsupported

MISSING
REJECTED
UNSUPPORTED

Missing31465
Missing (%)100.0%
Memory size245.9 KiB

rowguid
Categorical

UNIQUE

Distinct count31465
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
F34A07B0-6E8C-474B-A2E0-12515653DC26
 
1
DC158DF8-916A-4922-A691-3F41A3E8A371
 
1
B6CFD848-1E5C-42E7-BEAA-00592F76ECF8
 
1
9E875A7E-1A63-4E29-9E31-C67B9C5E0ECD
 
1
191DC789-FEF7-4F70-B938-F5A5CD7968D3
 
1
Other values (31460)
31460
ValueCountFrequency (%) 
F34A07B0-6E8C-474B-A2E0-12515653DC261< 0.1%
 
DC158DF8-916A-4922-A691-3F41A3E8A3711< 0.1%
 
B6CFD848-1E5C-42E7-BEAA-00592F76ECF81< 0.1%
 
9E875A7E-1A63-4E29-9E31-C67B9C5E0ECD1< 0.1%
 
191DC789-FEF7-4F70-B938-F5A5CD7968D31< 0.1%
 
4FFF56E8-6B0D-4FD3-8E47-672988B8AB2B1< 0.1%
 
A68AF604-C054-406C-A074-60CEA00E3E611< 0.1%
 
1A7A7B30-EA59-4CE8-A85E-AC6114E5DE1E1< 0.1%
 
C9F72640-443B-419E-AB41-4F5C47B2DEAC1< 0.1%
 
0518E6FC-D0A4-4CAA-B9D1-A968580A665C1< 0.1%
 
Other values (31455)31455> 99.9%
 

Length

Max length36
Median length36
Mean length36
Min length36
Distinct count1124
Unique (%)3.6%
Missing0
Missing (%)0.0%
Memory size245.8 KiB
Minimum2011-06-07 00:00:00
Maximum2014-07-07 00:00:00
Histogram

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

SalesOrderIDRevisionNumberOrderDateDueDateShipDateStatusOnlineOrderFlagSalesOrderNumberPurchaseOrderNumberAccountNumberCustomerIDSalesPersonIDTerritoryIDBillToAddressIDShipToAddressIDShipMethodIDCreditCardIDCreditCardApprovalCodeCurrencyRateIDSubTotalTaxAmtFreightTotalDueCommentrowguidModifiedDate
04365982011-05-312011-06-122011-06-075FalseSO43659PO52214578710-4020-00067629825279.05985985516281.0105041Vi84182NaN20565.62061971.5149616.098423153.2339NaN79B65321-39CA-4115-9CBA-8FE0903E12E62011-06-07
14366082011-05-312011-06-122011-06-075FalseSO43660PO1885012750010-4020-00011729672279.0592192155618.0115213Vi29411NaN1294.2529124.248338.82761457.3288NaN738DC42D-D03B-48A1-9822-F95A67EA73892011-06-07
24366182011-05-312011-06-122011-06-075FalseSO43661PO1847318962010-4020-00044229734282.0651751751346.085274Vi68544.032726.47863153.7696985.553036865.8012NaND91B9131-18A4-4A11-BC3A-90B6F53E9D742011-06-07
34366282011-05-312011-06-122011-06-075FalseSO43662PO1844417404410-4020-00022729994282.06482482510456.0125295Vi539354.028832.52892775.1646867.238932474.9324NaN4A1ECFC0-CC3A-4740-B028-1C50BB48711C2011-06-07
44366382011-05-312011-06-122011-06-075FalseSO43663PO1800918647010-4020-00051029565276.041073107354322.045303Vi22691NaN419.458940.268112.5838472.3108NaN9B1E7A40-6AE0-4AD3-811C-A64951857C4B2011-06-07
54366482011-05-312011-06-122011-06-075FalseSO43664PO1661712198310-4020-00039729898280.018768765806.095555Vi4081NaN24432.60882344.9921732.810027510.4109NaN22A8A5DA-8C22-42AD-9241-839489B6EF0D2011-06-07
64366582011-05-312011-06-122011-06-075FalseSO43665PO1658819157210-4020-00014629580283.01849849515232.035568Vi78804NaN14352.77131375.9427429.982116158.6961NaN5602C304-853C-43D7-9E79-76E320D476CF2011-06-07
74366682011-05-312011-06-122011-06-075FalseSO43666PO1600817388310-4020-00051130052276.0410741074513349.0105623Vi69217NaN5056.4896486.3747151.99215694.8564NaNE2A90057-1366-4487-8A7E-8085845FF7702011-06-07
84366782011-05-312011-06-122011-06-075FalseSO43667PO1542813259910-4020-00064629974277.03629629510370.055680Vi53503NaN6107.0820586.1203183.16266876.3649NaN86D5237D-432D-4B21-8ABC-671942F5789D2011-06-07
94366882011-05-312011-06-122011-06-075FalseSO43668PO1473218029510-4020-00051429614282.0652952951566.085817Vi80454.035944.15623461.76541081.801740487.7233NaN281CC355-D538-494E-9B44-461B36A826C62011-06-07

Last rows

SalesOrderIDRevisionNumberOrderDateDueDateShipDateStatusOnlineOrderFlagSalesOrderNumberPurchaseOrderNumberAccountNumberCustomerIDSalesPersonIDTerritoryIDBillToAddressIDShipToAddressIDShipMethodIDCreditCardIDCreditCardApprovalCodeCurrencyRateIDSubTotalTaxAmtFreightTotalDueCommentrowguidModifiedDate
314557511482014-06-302014-07-122014-07-075TrueSO75114NaN10-4030-02470424704NaN8249322493214318.0227542Vi22680NaN21.491.71920.537323.7465NaN1BE20D6F-497C-4D6A-97ED-BFFEA6310E152014-07-07
314567511582014-06-302014-07-122014-07-075TrueSO75115NaN10-4030-02683226832NaN8136181361814360.0327761Vi22867NaN80.476.43762.011888.9194NaN7883FF0C-9E63-4AA3-9B08-342476B19A0D2014-07-07
314577511682014-06-302014-07-122014-07-075TrueSO75116NaN10-4030-01640216402NaN102739227392115564.0728746Vi80423NaN4.990.39920.12485.5140NaN0B91256B-5206-4B0C-86DE-FE37E394798E2014-07-07
314587511782014-06-302014-07-122014-07-075TrueSO75117NaN10-4030-01817818178NaN102552225522118150.0928953Vi94119NaN29.482.35840.737032.5754NaN90A7DC3B-0848-4EE4-9110-8EBC46A2DEF02014-07-07
314597511882014-06-302014-07-122014-07-075TrueSO75118NaN10-4030-01367113671NaN82743927439110513.0828969Vi54175NaN135.2310.81843.3808149.4292NaNFDE750F5-95C9-4D63-AADC-807940A0FAFA2014-07-07
314607511982014-06-302014-07-122014-07-075TrueSO75119NaN10-4030-01198111981NaN1176491764916761.0429826Vi35166NaN42.283.38241.057046.7194NaN9382F1C9-383A-435F-9449-0EECEA21B78D2014-07-07
314617512082014-06-302014-07-122014-07-075TrueSO75120NaN10-4030-01874918749NaN6283742837418925.0929849Vi46003NaN84.966.79682.124093.8808NaNAE6A4FCF-FF73-4CD4-AF2C-5993D00D4AFE2014-07-07
314627512182014-06-302014-07-122014-07-075TrueSO75121NaN10-4030-01525115251NaN62655326553114220.0529864Vi73738NaN74.985.99841.874582.8529NaND7395C0E-00CB-4BFA-A238-0D6A9F49884F2014-07-07
314637512282014-06-302014-07-122014-07-075TrueSO75122NaN10-4030-01586815868NaN61461614616118719.0330022Vi97312NaN30.972.47760.774334.2219NaN4221035A-4159-492F-AF40-4363A64FFC162014-07-07
314647512382014-06-302014-07-122014-07-075TrueSO75123NaN10-4030-01875918759NaN61402414024110084.0230370Vi51970NaN189.9715.19764.7493209.9169NaND54752FF-2B54-4BE5-95EA-3B72289C059F2014-07-07